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Abstract 

In this paper, the problem of constructing an efficient quantum circuit for the implementation of 
an arbitrary quantum computation is addressed. To this end, a basic block based on the cosine-sine 
decomposition method is suggested which contains I qubits. In addition, a previously proposed quantum- 
logic synthesis method based on quantum Shannon decomposition is recursively applied to reach unitary 
gates over / qubits. Then, the basic block is used and some optimizations are applied to remove redundant 
gates. It is shown that the exact value of I affects the number of one-qubit and CNOT gates in the proposed 
method. In comparison to the previous synthesis methods, the value of I is examined consequently to 
improve either the number of CNOT gates or the total number of gates. The proposed approach is further 
analyzed by considering the nearest neighbor limitation. According to our evaluation, the number of 
CNOT gates is increased by at most a factor of | if the nearest neighbor interaction is applied. 



1 Introduction 

The promise of exponential speed up of quantum algorithms [l}j3] running on quantum computers has inten- 
sified the attempts for using quantum algorithms in real world problems. Besides that, the ability of quantum 
computation in simulating quantum-mechanical effects (QMEs) further increases its attraction. Classical com- 
putation has inherent limitations to simulate QMEs Gj. Due to the ability of quantum computation in solving 
several applications efficiently [5], numerous efforts have been made to study various aspects of quantum 
computation and a set of quantum technologies has been proposed to build a scalable quantum computer [6] . 

Aside from different technological characterizations, quantum computations provided by all of the proposed 
quantum technologies can be described in terms of linear operators (i.e., matrices) in Hilbert space. In this 
sense, any quantum computation involves evolution of an initial quantum state under a series of elementary 
unitary transformations [7 -10 . It has been shown that any unitary transformation can be exactly realized 
if the set of single qubit operations plus CNOT are allowed as the elementary gates [9 11. As CNOT needs 
interactions between two qubits which are usually much weaker than the interactions between a single qubit 
and classical control fields, producing fewer CNOT gates is more favorable generally. At the same time, 
reducing the number of one-qubit gates to improve the total synthesis cost is an active research field. 

Synthesis of an arbitrary unitary matrix from a universal set of gates including one-qubit operations and 
CNOTs has a rich history. Barenco et al. in 1995 [9] showed that the number of CNOT gates required to 
implement an arbitrary unitary matrix over n qubits was 0(n 3 4"). In 12 , it was shown that applying the 



QR decomposition of matrix algebra could lead to the same result. In 1995, Knill reduced the number of 
CNOT gates to 0(n4 n ) [13]. In 2004^ Shende et al. found the highest known lower bound on the number of 
CNOT gates as \\{A n -_3n - 1)] 



The process of reducing the number of CNOT gates was continued 
ed to 0(4") CNOT gates. Later, a decomposition method based on the 



14 



by Vartiainen et al. in [15| which 
cosine-sine matrix decomposition, called CSD, was proposed which produced 4™ — 2™ +1 CNOT gates 
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1 CNOT gates. Next, Mottonen and Vartiainen in 
I by improving the results of 
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reduced the number of CNOT gates to 
. By using the idea oFquantum Shannon decomposition (QSD) 
and some circuit identities, the authors of |2"0]reached the same number of CNOT gates. In 21 , an algorithm 



was proposed that translated a unitary matrix into a quantum circuit according to the KAK decomposition 
in Lie group theory. Their results showed that any matrix decomposition corresponding to type-AIII KAK 



decompositions could be derived according to the given Cartan involution by applying the method of 21 
Most recently, Drury and Love placed the QSD into a Lie algebraic context. Their results showed that QSD 
is an alternating series of Cartan decompositions [22] . 

Besides the significant attempts made in improving the worst-case number of CNOT gates and one-qubit 
gates required to implement an arbitrary unitary matrix, efforts have been made to improve the number of 
one-qubit gates and CNOTs for specific matrices with applications in the synthesis of an arbitrary matrix. It 
has been shown that three CNOT gates are necessary and sufficient to implement an arbitrary 2-qubit unitary 
matrix in the worst case 23 , 24 , 25 . In addition, seven one-qubit gates are reported to be sufficient by 
three CNOT gates for implementing an arbitrary 2-qubit unitary matrix 23 . In 
B. _ , , 

On the 
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a new quantum gate, 

was introduced and applied to synthesize a generic two-qubit operation. It has been shown that two B 
gates with at most six single-qubit gates implement any arbitrary two-qubit quantum operation 
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other hand, a decomposition algorithm which leads to 40 CNOT gates was proposed in [27] for an arbitrary 
quantum circuit with 3 qubits 
matrix requires 20 CNOTs 
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Currently, it appears that the best known quantum circuit for a 3-qubit 
whereas the lower bound is 14 



14 . The synthesis of an n-qubit Toffoli gate 



was studied in 28 where the authors showed that an n-qubit Toffoli gate requires at least 2n CNOT gates 
and for n — 3, their CNOT-cost (i.e., 6) is optimal. On different directions, synthesis of a diagonal matrix 
and optimal realization of controlled unitary gates were studied in 29 and 30 , respectively. The problem of 



preparing an arbitrary quantum state starting from a given state was discussed in some papers 17 20 31 

In this paper, the synthesis of an arbitrary unitary matrix over n qubits is addressed. To this end, the 
QSD method is used to synthesize a given unitary transformation first. Then, by determining an appropriate 
decomposition level, unstructured unitary matrices in the resulted decomposed circuit are replaced with blocks 
synthesized by CSD method equipped by some optimizations. We show that the decomposition level can be 
selected to trade off the number of CNOT gates against one-qubit gates in the proposed decomposition method. 

The remainder of the paper is organized as follows. In Section [2] basic concepts are explained. The 
proposed synthesis approach is introduced in Section [3] followed by considering the number of CNOT and 
one-qubit gates resulted from the proposed approach in Section [4] The nearest neighbor implementation of 
our synthesis method is evaluated in Section [5] and finally, Section [6] concludes the paper. 



2 Preliminaries 

In this section, the basic concepts and notations used in the rest of the paper are introduced and explained. 
2.1 Quantum Bits 

A quantum bit, qubit in short, like its classical counterpart can be realized by a physical system such as a 
photon. In this paper, we treat a qubit as a mathematical object which represents a quantum state with two 
basic states |0) and |1). A qubit can get any linear combination of its basic states, called superposition, as 
shown in Q where a and /3 are complex numbers. 

|V>)=a|0)+/3|l) (1) 

Although a qubit can get any linear combination of its basic states, when a qubit is measured, its state 
collapses into the basis |0) and |1) with the probability of |a| 2 and |/3| 2 , respectively and we have |a| 2 + |/3| 2 = 1. 
It is also common to denote the state of a single qubit by a 2 x 1 vector as [a /3] in Hilbert space H where 
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superscript T stands for the transpose of a vector. A quantum system which contains n qubits is often called 
a quantum register of size n in the context of quantum computation. More precisely, an n-qubit quantum 
register can be described by an element \tp) — \ipi) \ip 2 ) <8> ■ ■ • <£> \if> n ) m the tensor product Hilbert space 
H = Hi ® H 2 ® • • • ® H n . 



2.2 Quantum Gates and Circuits 



An n-qubit quantum gate is a device which performs a specific 2" x 2" unitary operation on selected n qubits 
in a specific period of time. A matrix U is unitary if LlW = I where is the conjugate transpose of U and 
I is the identity matrix. An arbitrary unitary gate over n qubits with a generic 2™ x 2™ matrix is represented 
as U[2 n ) in this paper. Previously various quantum gates with different functionalities have been introduced. 
For example, the 6 rotation gates (0 < 9 < 2ir) around the x, y and z axes acting on one qubit are defined as: 
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A square matrix is called diagonal if the entries outside the main diagonal are all zero. In other words, 
an n x n matrix A = [Si j) is diagonal if 6i j = 0, i ^ j, 1 < i, j < n. For example, R z {9) gate is a diagonal 
2x2 matrix. A generic diagonal gate over n qubits is represented by a 2" x 2™ unitary diagonal matrix. A 
diagonal gate over n qubits is denoted as A n throughout this paper. 

A block matrix is a partition of a matrix into rectangular smaller matrices called blocks. A block diagonal 
matrix is a block square matrix whose main diagonal blocks are square matrices. Hence, the blocks outside 
the main diagonal are zero matrices. A block diagonal matrix has the form shown in ^ where each Ui[2 m ) 
is a 2 m x 2 m matrix. The inverse of a block diagonal matrix is also block diagonal which is composed of 
the inverse of each block. A quantum gate B^(U[2 m )) over n qubits with m targets denoted by the set r 
and s — n — m select qubits on the most significant qubits has a block diagonal matrix 20 represented by 
B(U(2 m )) in this paper. In the case of only one target line (i.e., m = 1), the specific target line is shown 
instead of the set t. From the quantum circuit point of view, for a gate J3*(t/(2 m )) over n qubits with matrix 
B(U{2 m )) shown in Q, the matrix of Ui(2 m ) (1 < i < 2 n ~ m ) is applied to the target qubits where the index 
i depends on the values of s = n — m select qubits |20j . In this paper, a select qubit in a circuit is denoted by 
□ as in [20]. 

/ C/i(2 m ) ••• \ 

U 2 (2 m ) ■■■ 



B(U[2 m )) 



\ 



U 2n -m[2 m ) j 



The unitary matrix implemented by several gates acting on different qubits independently can be calculated 
by the tensor product ® of their matrices. Two or more quantum gates can be cascaded to construct a quantum 
circuit. For a set of k gates g±, g 2 , gk cascaded in a quantum circuit C in sequence, the matrix of C can 
be calculated as MfeMfc_i...Mi where Mj is the matrix of the i th gate (1 < i < k). 

Quantum circuits are often synthesized using a "basic gate" library [9] which contains CNOT and one- 
qubit gates. In contrast, an "elementary gate" library was used in [29] which contains CNOT and one-qubit 
rotation gates. The gate CNOT acts on two qubits (control and target) where the state of the target qubit is 
inverted if the control qubit holds the state |1). A CNOT gate with control c and target t is denoted as C c ' t in 
this paper. Since every gate in the elementary library is a basic gate and every basic gate can be decomposed 



into at most three elementary gates (see Section 2.3), the fi(gate count), 0(gate count), and ©(gate count) in 
either gate library are identical. In this paper, the basic gate library is applied to synthesize a given unitary 
matrix as discussed in Section [3l 
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2.3 Quantum Circuit Decomposition 



Two quantum circuits are equivalent if matrix products of their gates are identical. In order to synthesize a 
given unitary matrix, equivalent circuits may be applied to simplify the circuit. To do this, various quantum 



circuit identities have been proposed in recent years (for examples see 32 , 11 , 33 , [20]). 

An arbitrary one-qubit gate U(2) can be decomposed into R z and R y rotation gates (called ZYZ decom- 
position) as shown in ^ [9] . Hence, a one-qubit computation from the basic gate library can be implemented 
as a sequence of at most three gates from the elementary gate library. 

U{2)=R z {a)Ry{p)R z {\) (3) 

Cosine-sine decomposition [34] for a 2" x 2" unitary matrix U can be expressed by Q where L\, L 2 , Rx, 
and i?2 are unitary 2™ _1 x 2 n ^ r matrices and C and S are unitary 2"~ 1 x 2™ _1 diagonal matrices with real 
elements such that C 2 + S 2 = l n —\ {Jn-i is the identity matrix over n — 1 qubits). 



u -{o l,){-s c){ o k) < 4 » 

In the following, two quantum synthesis algorithms which are based on the cosine-sine decomposition 
method are described and used in the rest of the paper. 

2.3.1 CSD 



In 16 , a decomposition algorithm, called CSD, was proposed which uses the cosine-sine decomposition to 



decompose a generic U(2 n ). The CSD decomposition was further improved in 17 where the cosine-sine 
decomposition is recursively applied on each block diagonal gate. This process is stopped when there are 
only one-qubit block diagonal gates. More precisely, the CSD decomposition of a generic U(2 n ) matrix can 
be expressed by ^ where is the ruler function introduced in |35| is the position of the least 

significant nonzero bit in the binary representation of i). 



U{T)=B--\U{2)) H Bl-_ 1 ^(R y )Bl- 1 {U{2)) (5) 



=i 



Each block diagonal gate that appears in ^ can also be decomposed into a circuit which contains three 
block diagonal gates of smaller sizes and one CNOT gate [l7j. Equation ^ shows the decomposition for a 
B?-\U{2)). 

B?-\U(2)) = B:- 2 (U(2))C 1 ' n B n n - 2 (U(2))Br 1 (R z ) (6) 
Applying the recursive decompositions given in ([61 leads to a circuit with 2 rl_1 — 1 CNOTs and 2™ _1 



one-qubit gates followed by a diagonal gate A n where A n can also be decomposed into 2™ — 2 CNOTs 29 
Figure [I] illustrates the result of [17] for a block diagonal circuit with 4 qubits. 
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Figure 1: Decomposition of a block diagonal gate of size 4 uj\. Each U is a generic U{2) gate. 
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In addition, the authors of 17 used the notation given in ^ and converted ^ to ^ by replacing Q in 
([5]) and merging each A„ with its next block diagonal gate. Note that the notation ~ over a block diagonal 
gate means that this gate can be implemented up to a diagonal gate. Doing this could reduce the number of 
gates resulted from the decomposition of a unitary gate. 



U(2 n ) 



B^ 1 (U(2)) = A n B^(U(2)) 

2 »-i_i 



(7) 



(8) 



2.3.2 QSD 

Since the left and the right matrices in Q are block diagonals, they can be represented as block diagonal gate 
B*({/(2™ -1 )). In addition, the middle matrix in Q can be represented by B^ l _ 1 (R y ). Therefore, (Jij) can be 
rewritten as 

U(2 n ) = B 1 T (U(2 n ~ 1 ))B[ l - 1 (R y )B 1 T (U(2 n ' 1 )) (9) 
The authors of [20] showed that each block diagonal gate B} r {U{2 n - 1 )) in M can be decomposed into two 



generic unitary matrices and a specific block diagonal matrix as illustrated in (10) and (11) 

B([/(2"- 1 )) = 
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B 1 T {U{2 n - 1 )) = U(2 n - 1 )B[ l - 1 (R z )U(2 n - 1 ) 



(10) 



(11) 



By using ^ and ( 11 ), the method of [20j applies the quantum Shannon decomposition (QSD) shown in ( 12 ) 
to synthesize a given unitary matrix. The recursion is continued till U(4). Next, an optimal decomposition 
of t7(4) with three CNOT gates (23| is used. 



£7(2") = U(2 n - 1 )B[ l - 1 (R z )U(2 n ^ 1 )B[ l - 1 (R y )U(2 n - 1 )B[ l - 1 {R z )U(2 n - 1 ) 



(12) 



The complete circuit diagram of Q is shown in Fig. |] where B"~ 1 (R Z ) and B"-\R y ) gates are illustrated 
by dashed lines. In this paper, we use slash to denote that a given line may carry an arbitrary number of 
qubits. In addition, for a circuit with n qubits, circuit lines are numbered from 1 (top) to n (bottom). 
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Figure 2: Applying QSD 23] on a U(2 n ) matrix 



3 The Proposed Synthesis Method 

In this section, a synthesis approach for quantum circuits is introduced. The proposed approach is based on 
the CSD and QSD methods. 
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3.1 The Basic Block 



In this part, a basic block is introduced which is used in the proposed decomposition method. Consider a 
generic unitary matrix over I qubits and the CSD method is used to synthesize it as discussed in Section [2.3.1| 
The resulted circuit without the last A; gate is considered as the basic block in the proposed decomposition 
method. The proposed basic block is shown in (131 denoted as Q(2 l ). The complete equation for Q(2 l ) is 
given in (14). The CSD decomposition for U(2 3 ) and our basic block are illustrated in Fig. [3] The rightmost 
sequence of block diagonal R z gates corresponds to a single A 3 gate. 



U(2 l ) = A ; Q(2 ; ) 



(13) 



Q(2 l ) = B\-\U(2)) J] B l -\ [t) {U(2))B\-\U{2)) 



(14) 
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Figure 3: The proposed basic block for 3 qubits. 



3.2 Block-Based Quantum Decomposition (BQD) 

In this subsection, the proposed decomposition method is introduced. The following proposition is used in 
the rest of this section to remove unnecessary gates. 

Proposition 1 (A; ® In-ijB^ 1 (R k ) = B"^ 1 (R k )(Ai ® /„_;) where I < n — 1 and fc = z,y. 

As there is no overlap between the target line of B™~ l {Rk) and A; for I < n — 1 and k = z,y, Proposition [l] 
is held. Figure |4] illustrates the result of Proposition [T] on n qubits for A„_i. 



Bl- 1 (R k ) Bl- 1 (R k ) 




Figure 4: Application of Proposition [T] on n qubits. 
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Reconsider the QSD method shown in (12). It can be verified that the recursive application of (12 1 up to 
I qubits leads to (15 1 where each U(2 l ) is denoted as U to save space. 



I7(2«) = I/S^! (fl,)!/^.! (J2„ )L7'^_ I (Ji,)!/^:^! (fl:,)...^^.! (i^)...^"^-! 
B^- 1 {R z )...Bl' 1 {R y )...B r r 1 {R z )... 



(15) 



Based on ( 13 1, each U(2 l ) in ( 15 ) can be replaced by AiQ(2 l ). The result is given in ( 16 ) where each Q(2 l ) 
is denoted as Q. 



U{2 n ) 



AiQB l n _ l (R z )AiQB\ l _ l (R y )AiQB l n _ l (R z )A l QB l +} l _ 1 (R z )„ 

B[ l - 1 (R z )...B[ l - 1 (R y )...B[ l - 1 (R z )... 

B l +} l _ x {R z )...B l £ l _ 1 (R y )...B^ 



(16) 



Now, direct applying of Proposition [l] on ( 16 ) moves all A/ gates to the right side of block diagonal gates. 
Then, each A/ gate can be merged with its adjacent Q(2 l ) gate which leads to a U(2 l ) gate. The resulted 
U{2 1 ) gate can also be replaced by A;Q(2 ( ). Continuing this process leads to (17). 



[7(2" 



A l QB^ l (R z )QB^ l (R y )QB^ l (R z )QBl+_\_^^ 



? z+i 



3 n-l 
?Z+1 



B r r L (R z ). 



.sr 1 ^)...^- 1 ^). 



(17) 



Based on the above equations, the proposed block-based quantum decomposition (BQD) method is given 
in ( p~8| ) where each Q(2 l ) is denoted as (19). The circuit diagrams of (18) and (19) are illustrated in Fig. [H] 
and Fig. [6j respectively. Figure [7] shows the result of BQD method for U(2 e ) where I = 4. 



U(2 n ) = A / g(2"- i )i? 1 (i? z )g(2"- i )S 1 {R y )${2 n ~")B 1 (i? z )g(2"- i ) 



c( is _ f g(2' i - 1 )sr 1 (i? z )g(2- 1 )i?r 1 (i?,)g(2- 1 )sr 1 (i? z )g(2' i - 1 ) / < i < n 

^ 2 >-\ Q{2 1 ) l = l 



(18) 



(19) 



Note that only one A; exists after final decomposition stopped at level I. To determine I, i.e., decomposition 

U(2 n ) B?- 1 ^) B^-^Ry) B?-\R X ) 



/=T¥1 ■ 





~rT\ 
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g( 2 — 1 ) 



g( 2 — 1 ) 




0(2" 



0(2" 



Figure 5: The proposed decomposition method 
level, number of CNOT gates and one-qubit gates should be evaluated as done in the following subsections. 
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Figure 6: The circuit diagram of (19) 
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<3(2 5 ) Q(2 5 ) g(2 5 ) g(2 5 ) 

Figure 7: Decomposition of a C/(2 6 ) using BQD method. Q(2 4 ) is denoted as Q. 



3.3 Synthesis Cost 
3.3.1 CNOT cost 



Equation (20) shows the recursion formula for the number of CNOT gates for the circuit of (19) where 2 1 " 1 
is the number of CNOT gates for each R z gate 20 . In addition, the block diagonal R y gate in Fig. [6] can be 
transformed in the decomposition into a block diagonal U(2) rotation gate up to a diagonal gate. As a result, 



it can be implemented by one less CNOT compared with the block diagonal R z gate 19 , 20 

3 x 2 1 - 1 



4C, 

C c 



C 4 _ a 



1 



I < i < n 
i = I 



Applying the recursion formula of (|20|) leads to (|21 ) 
C, 



C, 



Al — l 



(C Ql +3x2'- i )-3x2 



3 



I < i < n 



(20) 



(21) 



Now consider the proposed BQD method shown in (18). Equation (22) shows the number of CNOT gates 
in the proposed method where 2 l — 2 is the number of CNOT gates for A; 



2!) 



c. 



c n 



(22) 



One more CNOT gate can be eliminated in Q(2 ) A; 17 . This comes from the fact that Q(2') A z can 
be written as a sequence of B l ~ 1 {U{2)) gates, where i G {1,1 + 1, ...,n — l,n}, followed by a chain of block 
diagonal R z gates (see Fig. [3]). The single rotation gate in the chain can be absorbed into a U(2) gate 
appeared after the decomposition of B 1 ^ 1 (U (2)) gate. Then, the sub-circuit I ® U(2)C 1,2 I <g> U(2), resulted 
from the decomposition of B l ~ 1 {U{2)) gate, followed by a B^ l _ 1 (R z ) gate constructs a B*(U(2)) gate, which 
can be implemented by two CNOT gates [20], one less than the number of CNOT gates required for the 
implementation of/® U(2)C 1,2 I (8) U(2)B} t _ 1 (R z ) sub-circuit. Equation (23) shows the final result. 



Cr 



An— I 



(C Ql + 3 x 2'- 1 ) -3x2 



(23) 
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The number of CNOT gates for Q(2 l ) (see ( 14 ) ) can be computed by (24) where 2' — 1 is the number of 
block diagonal gates in Q(2 l ) each of which can be implemented by 2 i_1 — 1 CNOT gates [17] . 



C'c 



{2 l - 1) x (2 



l-i 



1) 



(24) 



Altogether, the number of CNOT gates for the proposed BQD method is shown in ( 25 ) for the decompo- 
sition level I. 



Cr 



4™( 



2 1 

j + -) - 3 x 2™- 1 

3 x 4 l 2 1 



(25) 



3.3.2 One-qubit cost 

To count the number of one-qubit gates in the proposed decomposition method, an analysis similar to the one 



performed for CNOT cost can be used. Equation (26) shows the number of one-qubit gates for the proposed 
BQD method where 2 x 4 3 -1 is the number of one-qubit gates merged between B(R y ) and B(R Z ) gates 
at each recursion step. 2 l — 1 is the number of one-qubit gates needed for diagonal gate A;. Besides, I more 
gates can be eliminated at the last U(2 l ) gate decomposed using CSD method 



19 



O, 



4"-'(0 Qi +3x2'- i )-3x2 



(2 x 



1 71 — 1 



) + {2 l -l)-l 



(26) 



Number of one-qubit gates for Q(2 ) is computed by (27) 



Ql = (2 l - 1) x (2'- 1 - 1) 



Replacing (27) in (26) and doing some simplifications lead to (|28j) for decomposition level I. 

-2 



2* - 1 - 



1 



(27) 



(28) 



4 Result Comparison 

In this section, the results of the proposed decomposition approach is presented. To this end, the effect of 
the decomposition level on the synthesis cost in the BQD method is discussed first. Table [T] and Table [2] 
show the number of CNOT gates and one-qubit gates for 4 < n and 3 < I < n up to n = 12, respectively. 
Additionally, the total number of gates in BQD method for each decomposition level is shown in Table [3| 
Note that the synthesis results for I = n in the proposed decomposition method (gray cells) are the same as 
the ones produced by the improved CSD method (i.e., 17 ). The results of the QSD method 20 are also 
shown in these tables. 

The results given in Table [TJ Table [2] and Table [3] reveal the following facts regarding the behavior of the 
BQD method compared to the other approaches: 



• In terms of the number of CNOT gates, QSD and 19 work always better than BQD and BQD works 
better than CSD. 

• In terms of the number of one-qubit gates, improved CSD is better than BQD and BQD is better than 
QSD and 



19 all the time. Similar result is obtained in terms of total number of gates. 



As can be seen in Table [3j in I = n — 1, the total number of gates produced by BQD is one gate more 
than the number of gates produced by improved CSD. However, in those cases, BQD produces fewer 
CNOT gates. Since CNOT cost is usually much more than the cost of one-qubit gates, BQD produces 
better circuits compared to the other methods as far as the total number of gates and technological 
limitations are concerned. The appropriate I is n — 1 for this case. 



9 



Table 1: Number of CNOT gates in the proposed BQD, improved CSD [P7] (gray cells), and QSD [20] methods 
for 4 < n < 12. 
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Table 2: Number of one-qubit gates in the proposed BQD, improved CSD [17] (gray cells), and QSD 20 
methods for 4 < n < 12. 
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While QSD and 19 lead to better CNOT cost compared to improved CSD and BQD, if one-qubit gate 
count is also concerned, BQD produces about 5% more CNOT gates and about 28% fewer one-qubit 
gates compared to QSD on average. Altogether, the total number of gates is improved by about 16% 
in BQD compared to QSD on average. The best results in terms of the number of CNOT gates for 
each n are boldfaced in all tables. In some cases, there are two different decompositions which have the 
same number of CNOT gates and different numbers of one-qubit gates. In those cases, decompositions 
with fewer one-qubit gates are selected. Percentages of improvement (BQD vs. QSD) for CNOT gates, 
one-qubit gates, and total gates are shown in Fig. [8] for this case. 

A comparison of the results produced by BQD and improved CSD reveals that for this case, BQD leads 
to fewer CNOT gates with the penalty of more one-qubit gates. Percentages of improvement (BQD vs. 
improved CSD) for CNOT gates and one-qubit gates are almost equal (with a negative sign). However, 
fewer CNOT gates is more desirable as discussed. 

Producing fewer CNOT gates leads to better quantum circuits considering the current technological 



limitations. To find the exact gate counts in the proposed method, we set derivation of (251, i.e., the 
number of CNOT gates for BQD method, with respect to / to zero which leads to I = n + lo ^W 3 ) _ 
According to the results given in Table [I] and Table [2] it can be verified that I — \2n/3] leads to the 
achieved results. Therefore, the number of CNOT gates and one-qubit gates in the proposed BQD 



method can be calculated as ( 29 ) and (130| , respectively. 
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Table 3: Total number of gates in the proposed BQD, improved CSD [17] (gray cells), and QSD 20 methods 
for 4 < n < 12. 
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Figure 8: Percentages of improvement for CNOT, one-qubit and total gates (BQD vs. QSD) 
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(30) 



According to the above discussion, we select / = [2n/3] in the proposed decomposition method. Table [4] 
and Table [5] show the number of CNOT gates and one-qubit gates for different quantum synthesis methods. 
Considering the fact that number of CNOT gates in the best reported synthesis methods 



19 20 



is away 

from the lower bound by a factor of two, more improvement in the synthesis cost needs consideration of both 
CNOT and one-qubit gates. Simultaneous reduction of CNOT and one-qubit gates has also been recently 
pursued by other researchers 



2.x 



5 Nearest Neighbor Implementation 

While several impressive physical realizations have been proposed for quantum computers, all of these tech- 
nologies have serious intrinsic limitations which should be resolved in future [6j . Among the different techno- 
logical constraints, the limited interaction distance between qubits is one of the most common ones. Although 
arbitrary-distance interaction between qubits is possible in quantum computer technologies with moving qubits 
(for example in a photon-based system 



36 



), some restrictions exist in other quantum technologies. Indeed, 
many physical quantum computer proposals only permit interactions between adjacent (nearest neighbor) 
qubits 



37 . For example, liquid nuclear magnetic resonance (NMR) [38], and the original Kane model [39] 



were designed based on the interactions between linear nearest neighbor (LNN) qubits. 
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Table 4: Comparison table for the number of CNOT gates in various quantum synthesis methods 
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Table 5: Comparison table for the number of one-qubit gates 
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Eq. 1 30 



In this section, the worst-case cost of BQD synthesis method for nearest neighbor architecture is studied. 
To this end, the structure of the proposed basic block, the block diagonal and the diagonal gates under LNN 
constraints should be evaluated. Reconsider the proposed basic block Q(2 l ) which has 2 l — 1 block diagonal 

Equation (31) shows the number of adjacent CNOT gates for a block diagonal gate with I — 1 control 

-i- ' 
2 



gates. 

lines where s is between 1 and |~|] 



17 



5rJ 



21 




(31) 



A comparison between the number of CNOT gates required to implement a block diagonal gate without 
the LNN constraint (i.e., 2 l ~ 1 — 1) and with this constraint (i.e., 0(|2')) reveals that the number of CNOT 
gates for a block diagonal gate is increased by a factor of | in the LNN architectures. 

Equation (32) shows the number of adjacent CNOTs of a block diagonal gate with n — 1 
where target is at the I s * line (without the last A n gate) in the LNN architectures 
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control lines 
The block diagonal 

R y gates produced in the QSD method can also be implemented by the same number of adjacent CNOTs as 
discussed in Subsection I3JJJ 



Cv 



5nti 
6 Z 



2n- 



3 
23 
3 



Odd 



(32) 



Equation (33) shows the number of adjacent CNOTs required for block diagonal R z gates 17 which have 



been produced during QSD decomposition. 



C 



nr 1 



5 o" 



3n — 



22 



~ n odd 



(33) 



A comparison between the number of CNOT gates required to implement a block diagonal gate without the 



LNN constraint (i.e., 2 r 



1) and with this constraint (i.e., 0(|2™)) reveals that the number of CNOTs for 



a block diagonal gate is increased by a factor of | in the LNN architectures. 

On the other hand, the last diagonal gate can be considered as a cascade of block diagonal rotations 
with i € {1, • • • ,1—1} controls and one target placed at the 1 st qubit [29]. Therefore, the nearest neighbor 
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implementation of A; can be computed as ( 34 ) where Ca, shows the number of CNOT gates in Aj for LNN 
architectures. Equation (34) reveals that Ca ; is of 0(|2 Z ) for LNN architectures. 



( 




'^-l - 4r n even 

6 3 



(34) 



An exact evaluation of the above costs shows that applying LNN constraints increases CNOT count by at 



6 Conclusions and Future Directions 

In this paper, a decomposition approach for quantum-logic synthesis was proposed which is based on the 
previous CSD and QSD methods. To do this, a basic block based on CSD method was constructed which 
contains I qubits. Next, the proposed circuit for a generic unitary gate over I qubits was used as a basic block to 
present our quantum synthesis method. In doing so, the previously proposed QSD method for quantum-logic 
synthesis was recursively applied to reach unitary gates over I qubits. Then, the proposed basic block was used 
and further optimizations were applied to remove redundant gates. To evaluate the proposed method, number 
of CNOT and one-qubit gates with and without the nearest neighbor constraint was analyzed. According 
to our analysis, the decomposition level I can be selected to trade off the number of one-qubit gates against 
CNOT gates. Since producing fewer CNOT gates is more desirable, the decomposition level is set to [2n/3] 
in the proposed decomposition method. 

Due to the potential of the proposed decomposition method in consideration of the number of CNOT and 
one-qubit gates, it is our hope that the proposed synthesis method may lead to further improvement in the 
number of CNOT and one-qubit gates simultaneously in future. 
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